Preset Two-head Automata and Morphological Analysis of Natural Language*
ثبت نشده
چکیده
Modeling the morphological structure of natural languages in terms of a nondeterministic finite-state automaton is shown to be inadequate in its handling of some common natural language phenomena. We show that a two-tape nondeterministic automaton is capable of handling these phenomena. The modeling is improved by the specification of a new type of automaton, the preset two-head automaton, which we argue is equivalent in expressive power to a linear context-free grammar. We discuss the operation of a parser which implements the improved model.
منابع مشابه
Morphological Comparison of two populations of lake goby Rhinogobius similis Gill, 1859 from Hariroud basin
Knowledge on the fish species is important in habitat protection management. This study was conducted to compare the morphological characteristics of two populations of Rhinogobius similis from Hariroud basin based on landmark morphometric truss network system. A total of 60 individuals from Polkhatoun (30 specimens) and Tafrihgah dam (30 specimens) stations were caught by electrofishing 220 vo...
متن کاملVisits, Crosses, and Reversals for Nondeterministic Off-Line Machines
The different concepts involved in "reversal complexity"--counting reversals (sweeps), visits to a square, or crossing sequences--are discussed for nondeterministic off-line Turing machines with one working tape and for preset Turing machines, a generalization of two-way checking automata. Restriction to finite reversals or visits or crosses yields the same family, NSPACE(log2 n), for off-line ...
متن کاملApplications of Finite-State Transducers in Natural Language Processing
This paper is a review of some of the major applications of nite-state transducers in natural-language processing ranging from morphological analysis to nite-state parsing. The analysis and generation of in ected word forms can be performed e ciently by means of lexical transducers. Such transducers can be compiled using an extended regular-expression calculus with restriction and replacement o...
متن کاملFinite-State Registered Automata and Their Uses in Natural Languages
We extend finite state registered automata (FSRA) to account for medium-distance dependencies in natural languages. We provide an extended regular expression language whose expressions denote arbitrary FSRAs and use it to describe some morphological and phonological phenomena. We also define several dedicated operators which support an easy and efficient implementation of some non-trivial morph...
متن کاملDeveloping a New Approach for Arabic Morphological Analysis and Generation
Arabic morphological analysis is one of the essential stages in Arabic Natural Language Processing. In this paper we present an approach for Arabic morphological analysis. This approach is based on Arabic morphological automaton (AMAUT). The proposed technique uses a morphological database realized using XMODEL language. Arabic morphology represents a special type of morphological systems becau...
متن کامل